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DESCRIPTION 

PROGRAM, METHOD, AND APPARATUS FOR DECODING CODED DATA 

5 TECHNICAL FIELD 

The present invention relates to a program, a method, and an apparatus for 
decoding coded data, which allows an input of a signal coded by lossy compression 
and an output of a decoded signal thereof. 

10 BACKGROUND ART 

When information of an image or a voice is transmitted/stored as a digital 
signal, a coding technology that uses its redundancy to reduce a data volume is 
widely used. In this case, if distortion is permitted to a reproduced signal based on 
the fact that human audio-visual characteristics include insensitiveness to small 

15 distortion, compression can be performed at a higher level. Such a coding method 
is called lossy coding because an original signal is not accurately reproduced. 

A method of decoding data coded by lossy coding can generally be realized 
by performing its inverse processing when a coding method is provided. FIG. 8 is a 
block diagram showing a decoder of JPEG (Joint Photographic Experts Group) 

20 coded data widely used for, e.g., image coding, which is disclosed in p. 21 of IEEE 
Transactions on Consumer Electronics, Feb. 1992. The decoder shown in FIG. 8 
includes a variable length decoder 80 for receiving a coded stream, an inverse 
quantizer 81, an inverse DCT (Discrete Cosine Transform) converter 82, a limiter 83, 
and an integer-conversion unit 84. While not disclosed in the above document, the 
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limiter 83 and the integer-conversion unit 84 are generally widely used, and added to 
description below as they are important. 

Next, an operation will be described. The JPEG coding is designed to 
subject the original signal to DCT conversion by a block unit, thereby coding its 
5 quantized index. Thus, the decoder only needs to perform its inverse process. In 
FIG. 8, the variable length decoder 80 decodes a JPEG stream into a quantized 
index of a DCT coefficient. The inverse DCT converter 82 subjects the DCT 
coefficient to inverse conversion to reproduce the original signal. An output of the 
inverse DCT converter 82 is a real number. When the reproduced signal is digitally 

10 represented, it must be converted into a discrete signal of a certain range. This 
conversion is performed by the limiter 83 and the integer-conversion unit 84. The 
limiter 83 clips the output of the inverse DCT converter 82 in an existing range, and 
the integer-conversion unit 23 converts its result into an integer. The clipping and 
the integer-conversion may be reverse in order. As a result, for an output of the 

15 integer-conversion unit 84, an output of a form to be represented as digital data is 
obtained. The process is performed by block units, and all blocks are integrated to 
obtain decoded data at the end. 

The process of the limiter 83 and the integer-conversion unit 84 is 
many-to-one mapping. Thus, the original DCT coefficient cannot be restored from 

20 its output. This means that information of a part of the quantized index of the DCT 
coefficient represented as the stream is lost by decoding. 

This information loss has not posed a big problem as long as an image to be 
reproduced exists only to be consumed by a human. In other words, such an error 
introduced by decoding has not been a problem because it is too small to be sensed 



by the human. However, communication diversity has been accompanied by 
recognition that it is important to hold a structure of a coded image. For example, 
for secondary use, a decoded image may be recoded losslessly, information such as 
an electronic watermark may be contained in the image, and thus the information 
5 must be held even after decoding. A conventional decoding method cannot 
guarantee lossless receding or a remaining watermark. 

The conventional decoding method of coded data has a problem that when 
an error is introduced as a result of clipping or integer-conversion of an inversely 
converted signal, and a decoded signal is coded again, the same coded stream 
10 cannot be obtained. 

Furthermore, the error caused by the clipping or the integer-conversion 
poses a problem that information embedded in the quantized DCT coefficient is lost. 

The present invention has been made to solve the above problems, and has 
an object to provide a program, a method, and an apparatus for decoding coded data, 
15 which are capable of completely restoring an original stream when decoded data is 
coded again. 

It is another object to provide a program, a method, and an apparatus for 
decoding coded data, which are capable of completely holding information embedded 
in a quantized DCT coefficient by maintaining a structure of a signal represented as a 
20 stream within a certain range of quantization accuracy. 

DISCLOSURE OF THE INVENTION 
An apparatus and a method for decoding coded data according to the 
present invention each include: a first projection means (or step) for receiving an 
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input of a signal coded by lossy compression and orthogonally projecting an optional 
real number vector on one convex aggregate X in a first vector space in which the 
decoded signal is present; convergence judgment means (or step) for judging 
convergence of convex projection through the first projection means (or step) and 
5 obtaining a real number vector x belonging to the aggregate X to output the same as 
a decoded signal when the convergence of the convex projection is judged; and a 
second projection means (or step) for orthogonally projecting an optional vector of 
the first vector space on one convex aggregate Y in the second vector space different 
from the first vector space when the convergence of the convex projection is not 
10 judged, and then moving to the first projection means (or step) to repeat orthogonal 
projection on the aggregate X and the aggregate Y with the coded signal set as an 
initial value. 

Furthermore, the program for decoding coded data according to the invention 
serves to cause a computer to function as each means of the decoder of the coded 
15 data. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a schematic diagram showing an example of a change in value of a 
quantized index caused by clipping of decoded conversion coefficient to explain an 
20 approach of the present invention; 

FIG. 2 is a diagram showing a situation of sequentially obtaining vectors x 0 , 
xi, x n for two aggregates X and Y of FIG. 1 by repeating orthogonal projection in 
the aggregate X and the aggregate Y with y c set as an initial value to explain a first 
embodiment of the present invention; 
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FIG. 3 is a diagram showing another convex aggregate W (= kY) constituting 
a partial aggregate of the aggregate Y of FIG. 2; 

FIG. 4 is a block diagram showing a case where a decoder of coded data as 
hardware for explaining decoding program and method of the coded data according 
5 to the present invention is constituted of a computer; 

FIG. 5 is a flowchart showing contents of the decoding program of the coded 
data which is stored in an ROM 4B of FIG. 4 to operate based on control of a CPU 
4A; 

FIG. 6 is a block diagram showing a functional configuration of the decoder of 
10 the coded data of the present invention corresponding to the flowchart shown in FIG. 

5; 

FIG. 7 is a diagram showing a situation of obtaining a vector x n which is a 
common element by repeating convex projection between the aggregate X and the 
aggregate Y with a vector y c obtained by inverse-quantizing a quantized index of a 
15 transmitted DCT coefficient by Q c set as an initial value to explain a second 
embodiment of the present invention; and 

FIG. 8 is a block diagram showing a conventional decoder of coded data. 

BEST MODES FOR CARRYING OUT THE INVENTION 
20 The best modes for carrying out the present invention will be described in 

detail below with reference to the accompanying drawings. 

First, problems to be solved by the invention and approaches to the problems 
will be specifically described. Below, for convenience, JPEG coding that 
compresses image data will be described as a target. However, application of the 



* invention is not limited to this. 

FIG. 1 schematically shows an example of changing a value of a quantized 
index by clipping a decoded conversion coefficient. In FIG. 1, y c is a vector of a 
quantized DCT coefficient, i.e., a known vector during decoding. This vector is 
5 obtained in an output of an inverse quantizer 81 of FIG. 8. 

Y denotes an aggregate of vectors of DCT coefficients to be quantized. The 
y c converted into a signal of a time area by the inverse DCT converter 82, and then 
clipped in a range of a fixed value by a limiter 83. X denotes an aggregate of 
vectors in the range of such a value. A process of the limiter 83 is regarded as a 
10 process of subjecting the inverse-DCT y c to orthogonal projection on the aggregate X. 
In FIG. 1, an output vector of the limiter 83 is denoted by x 0 . 

As this vector is a real number vector, it is converted into an integer to finally 
obtain a decoded vector Xd. In this case, the vector Xd is decoded outside the 
aggregate Y by clipping. This means that when the vector x d is coded again, its 
15 quantized vector does not match the y c , and information of a DCT coefficient 
quantized by decoding is lost. 

This invention uses a convex projection method for decoding a coded stream 
to solve the problem. The convex projection method is a method of obtaining a 
common solution by repeating orthogonal projection on the aggregate X and the 
20 aggregate Y starting from an optional initial value when the two aggregates X and Y 
are each characterized by belonging to a convex aggregate, i.e., points or their 
aggregate on a line having two elements belonging to an aggregate set as points of 
both ends. 

FIG. 2 shows a situation of sequentially obtaining vectors x 0 , Xi, x n by 



repeating orthogonal projection on the aggregate X and the aggregate Y with y c set 
as an initial value for the two aggregates X and Y of vectors in which a range to be 
taken by each component of FIG. 1 is limited. By repeating this operation, Xj 
approaches the vector x n belonging in common to the aggregates X and Y without 
5 limit. In this case, the aggregates X and Y are obviously convex aggregates as they 
are defined as aggregates in which their components are in the fixed range. 
Accordingly, a common solution x n can always be obtained. 

When an integer vector closest to the x n is Xd, if the x d is an element of the 
aggregate Y (Xd is always an element of the aggregate X), information that the coded 

10 quantized vector has had is saved. It is because its quantized vector matches the y c 
if this vector is coded again. 

When the vector x d is not an element of the aggregate Y, another integer 
vector must be searched. FIG. 3 shows another convex aggregate W (= kY) which 
is not an aggregate Y itself but constitutes its partial aggregate. In this case, the 

15 aggregate W is defined as an aggregate of vectors in which a range of values to be 
taken by components is limited, e.g., an aggregate or the like in which a range of 
each component of the aggregate Y is reduced by k (k < 1) times. It can be 
understood that another vector x n present in a deeper part of X n Y is searched by 
repeating orthogonal projection between the aggregate X and the aggregate W with 

20 y c set as an initial value. 

Thus, when this vector is converted into an integer vector to be set as x d , a 
probability that this belongs to the aggregate Y becomes high. From the above 
discussion, it can be understood that by searching a vector based on convex 
projection while reducing a value of k, an integer vector present in common to the 



aggregate X and the aggregate Y can be obtained with high probability. 
First Embodiment 

FIG. 4 is a block diagram when a decoder of coded data as hardware to 
5 explain a program and a method for decoding the coded data according to the 
present invention is constituted of a computer. The hardware configuration of the 
decoder 4 of the coded data shown in FIG. 4 includes a CPU 4A, an ROM 4B for 
storing a decoding program of the coded data as a processing program, fixed data or 
the like, an RAM 4C for temporarily storing processing data, input data or the like, 

10 and an input/output unit 4D. A signal coded by lossy compression is input, 
processed based on control of the CPU 4A according to the decoding program of the 
coded data stored in the ROM 4B, and a result thereof is output as a decoded signal. 

FIG. 5 is a flowchart showing the decoding program of the coded data stored 
in the ROM 4B and operated based on the control of the CPU 4A. In FIG. 5, in an 

15 initialization step S51, a quantized index y q of a received DCT coefficient is 
inverse-quantized to obtain an initial vector y for convex projection. A value of k is 
set to 1. In other words, a process represented by the following expressions is 
carried out. 

y <- cr 1 (y q ) 

20 k<-1 

CT 1 denotes computation of inverse quantization. 
In this case, a value of y is held as y c as it is necessary in a later step. 
Next, in a projection step S52, the vector y is subjected to inverse-conversion 
to be orthogonally projected on the aggregate X. In this case, specifically, for the 
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orthogonal projection on the aggregate X, each component of a vector obtained by 
inverse-converting y needs to be clipped in a range from a to b (a < b). More 
specifically, a component- smaller than a is changed to a, a component larger than b 
is changed to b, and other components are kept as they are. In a typical case 
5 where an image signal is represented by 8 bits, a = 0, b = 255, or the like may be set. 
This process is represented by the following expression, 
x <- Px (T 1 (y)) 

T 1 (•) denotes inverse-conversion, and P x (•) denotes orthogonal projection 
on the aggregate X. 

10 Next, in a convergence judgment step S53, checking is made as to whether a 

vector has been changed or not by the orthogonal projection on the aggregate X. If 
a change is small, the process proceeds to an integer-conversion step S55 
understanding that convex projection has converged, or to a projection step S54 if a 
change is large. In this case, a size of the change may be judged by, e.g., the 

15 following. 

|x-T 1 (y)|<s 

| | is Euclidean norm, and & is a positive number decided by accuracy of a 
computing circuit. 

Next, in the projection step S54, x is subjected to DCT conversion to obtain T 
20 (x) (T (•) denotes DCT conversion), and this is orthogonally projected on the 
aggregate kY. In this case, in the orthogonal projection, each component of T (x) 
only needs to be corrected within a fixed range so that it can become equal to y c 
when the T (x) is quantized. This correction can be executed as in the case of the 
projection step S52. The process of the step S54 is represented by the following. 



y P kY (T (x)) 

P kY (■) is orthogonal projection on the aggregate kY. 

When convergence of convex projection is obtained in the convergence 
judgment step S53, x is converted into an integer vector in the integer-conversion 
5 step S55. 

Next, in an end judgment step S56, checking is made as to whether this 
integer vector belongs to the aggregate Y or not. If it belongs to the aggregate Y, 
the decoding is finished with the vector set as an output vector. This end judgment 
only needs to check whether its quantized result coincides with y c or not after the 
10 integer vector is subjected to DCT conversion. In other words, x is an element of 
the aggregate Y if the following equation is established: 

Q (T (int (x)) = y c 

In this case, int (■) is integer conversion, and Q (•) is an operator of quantization. 

If the equation is not established, a value of k is reduced in a reduction step 
15 S57, x is orthogonally projected on the aggregate kY in a projection step S58, and 
then the process returns to the orthogonal projection step S52 to repeat convex 
projection. Here, the processing of the reduction step S57 may be performed by 
using a real number r smaller than 1 as follows. 

k<-rk 

20 The processing of the projection step S58 is completely similar to that of the 
projection step S54. 

FIG. 6 is a block diagram showing a functional configuration of the decoder of 
the coded data according to the present invention corresponding to the flowchart 
shown in FIG. 5 to explain the program and the method for decoding the coded data. 



As shown in FIG. 6, the functional configuration of the decoder of the coded 
data includes a first projection means 61 for receiving an input of a signal coded by 
lossy compression, and orthogonally projecting an optional real number vector on 
one convex aggregate X in a first vector space in which the decoded signal is present, 
5 convergence judgment means 62 for judging convergence of convex projection 
through the first projection means 61, and obtaining a real number vector x belonging 
to the aggregate X to output it as a decoded signal when the convergence of convex 
projection is judged, and a second projection means 63 for orthogonally projecting an 
optional vector of the first vector space on one convex aggregate Y in a second 

10 vector space different from the first vector space when the convergence of convex 
projection is not judged, and then moving to the first projection means 61 to repeat 
orthogonal projection on the aggregate X and the aggregate Y with the coded signal 
set as an initial value. 

Additionally, the decoder of the coded data includes integer conversion 

15 means 64 for converting a real number vector belonging to the aggregate X into an 
integer vector when the convergence of the convex projection is judged by the 
convergence judgment means 62, end judgment means 65 for judging whether the 
integer vector belongs to the aggregate Y or not and outputting the integer vector as 
a decoding vector when it is judged to belong to the Y aggregate, reduction means 

20 66 for reducing the aggregate Y to generates a new convex aggregate W which is its 
partial aggregate when the integer vector is judged not to belong to the aggregate Y, 
and a third projection means 67 for orthogonally projecting the converted integer 
vector on the convex aggregate W, then moving to the first projection means 61, and 
executing orthogonal projection between the aggregate W and the aggregate X to 
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repeatedly correct the real number vector x with the coded signal set as an initial 
value. 

In this case, each functional configuration shown in FIG. 6 corresponds to 
each step in the flowchart shown in FIG. 5. In other words, the first projection 
5 means 61 corresponds to the projection step S52, the convergence judgment means 

62 corresponds to the convergence judgment step S53, the second projection means 

63 corresponds to the projection step S54, the integer conversion means 64 
corresponds to the integer conversion step S55, the end judgment means 65 
corresponds to the end judgment step S56, the reduction means 66 corresponds to 

10 the reduction step S57, and the third projection means 67 corresponds to the 

projection step S58. 

According to the method of this invention, its convergence performance is not 

proved. In other words, the end may not be judged in the end judgment step S56 

even if k is set sufficiently small, and an integer vector present in common in the 
15 aggregate X and the aggregate Y may not be found. This easily occurs particularly 

when quantization is fine. It is because a size of the aggregate Y is reduced more 

as the quantization is finer, and as a result, integer vectors contained in the common 

aggregate XnY are reduced in number. 

Thus, this invention provides a decoding method for defining a quantized 
20 vector of accuracy different from that of a coded quantized vector and holding 

information of a DCT coefficient coded within a range of the quantization accuracy 

especially when quantization is fine. 

Second Embodiment 
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According to a second embodiment of this invention, a program, a method, 
and an apparatus for decoding coded data are similar in configuration to those of the 
first embodiment shown in FIGS. 4 to 6. However, different processing must be 
carried out in some steps. 

According to the second embodiment, a quantized vector is used for judging 
maintenance of a structure of coded data different from a quantized vector for coding. 
This quantized vector is denoted by Q w , and the quantized vector for coding is 
denoted by Q c . According to the first embodiment, Q w = Qc (=Q) is established. An 
aggregate X is defined as in the case of the first embodiment. An aggregate Y is 
defined as an aggregate of DCT coefficient vectors so that its result can become y w 
when it is quantized by Q w . Qw is set so that a sufficient number of integer vectors 
are present in the aggregate Y. 

FIG. 7 shows a situation of obtaining a vector x n which is a common element 
by repeating convex projection between the aggregate X and the aggregate Y with a 
vector y c obtained by inverse-quantizing a quantized index of a transmitted DCT 
coefficient by Q c set as an initial value. In this case, y c is an element of the 
aggregate Y. As shown in FIG. 7, according to the second embodiment where Q w 
and Q c are different from each other, an initial value of a vector y is obtained by using 
Q c> and the aggregate Y is defined by using Q w . 

Hereinafter, referring to FIG. 5, steps different from those of the first 
embodiment will be specifically described. In the initialization step 51, with y set as 
an initial value, an initial value of a vector y is obtained by using Q c . In other words, 
the following is calculated: 

y <- Qc" 1 (y q ) 
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k<- 1 

Here, y q is a quantized index of a received DCT coefficient. For subsequent 
judgment, a vector where y is quantized/inverse-quantized is obtained as follows: 
y w <- Qw" 1 (Qw (y)) 

5 The projection step S52, the convergence judgment step S53, the projection 

step S54, and the integer conversion step S55 are similar to those of the first 
embodiment. In the end judgment step S56, by using Q w as a quantization 
parameter, whether the following equation is established or not is checked to judge 
and end: 

10 Qw (T (int (x)))) = yw 

Here, x is an output vector of the projection step S52. 

The reduction step S57 and the projection step S58 are similar to those of 
the first embodiment. 

Thus, according to the second embodiment, as a decoded vector is 

15 subjected to DCT conversion again, and a result of quantizing this by Q w matches a 
result of quantizing the original coded data by Q w , a structure of the coded data is 
held in a signal decoded within accuracy range of Q w - Accordingly, for example, 
information such as an electronic watermark can be left in the decoded data with 
accuracy of Q w . 

20 By making the quantization parameter Q w proper, integer vectors present in X 

o Y can be obtained in most cases. This can be demonstrated by an experiment. 
Table 1 shows an experiment result showing that solutions are obtained in all blocks 
when two image data subjected to JPEG coding are decoded. 

In this case, Q w (vector of quantized width) is 8 with respect to all DCT 



coefficients. In Table 1 , n denotes the number of times of correcting k, and a value 
of k corresponding to n is shown in a second column. The experiment was carried 
out by setting three kinds of Q c and changing coding accuracy. 



5 Table 1 
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11 
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0.857 


6 


24 


0 


7 


20 


0 


4 
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86 
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81920 


81920 


81920 


2304 


2304 


2304 



The coding accuracy is represented by a parameter q. A value of Q c is 
smaller as a value of q is smaller, and coding accuracy is high. Q w = Qc is 
established with q = 1.0. An image 1 is a natural image, and an image 2 is a test 

10 image of wedge design. The total numbers of DCT blocks of these images are 
respectively 81 920 and 2304. 

By using the parameter, decoding was carried out as shown in the flowchart 
of FIG. 1, and the number of blocks in which an end is judged at an n-th time in the 
end judgment step S56 was counted. This number is written in a third column and 

15 after. For example, it is shown that when the image 1 coded with q = 1 .0 is decoded, 
among 81920 blocks, decoding is finished with k = 1 at a block of 81787, and the 
numbers of blocks ended with k = 0.950, k = 0.903, k = 0.857, and k = 0.815 are 
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respectively 77, 48, 6, and 2. In this case, it can be understood that decoding is 
finished for all the blocks before k = 0.815. Similarly, it can be understood that for 
other parameters, solutions are obtained for all the blocks before k = 0.663. 

By taking the example of JPEG coding, the image coding that uses the DCT 
5 conversion has been described. This invention is not limited to the image coding, 
but it can be applied to coded data such as a voice. By properly providing a 
aggregate Y, the invention can be applied not only to the DCT conversion but also to 
other coding systems such as wavelet conversion. Furthermore, the invention can 
be applied not only to coding of a static image but also to coding of a moving image 
10 such as MPEG. 
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